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Abstract 

We study the approximability of a number of graph problems: treewidth and pathwidth of 
graphs, one-shot black (and black-white) pebbling costs of directed acyclic graphs, and a variety 
of different graph layout problems such as minimum cut linear arrangement and interval graph 
completion. We show that, assuming the recently introduced Small Set Expansion Conjecture, 
all of these problems are hard to approximate within any constant factor. 
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1 Introduction 



One of the great accomplishments in the last twenty years in complexity theory has been the 
development of ideas that has led to a deep understanding of the approximability of an astonishing 
number of NP-hard optimization problems. More recently, in the last ten years, the formulation 
of the Unique Games Conjecture (UGC) due to Khot |Kho02j has inspired a remarkable body of 
work, clarifying the complexity of many optimization problems, and exposing the central role of 
semidefinite programming in the development of approximation algorithms. 

Despite this tremendous progress, for certain expansion problems such as the c-Balanced Sep- 
arator problem, and graph layout problems such as the Minimum Linear Arrangement (MLA) 
problem, their approximation status remained unresolved. That is, even assuming the UGC is not 
known to be sufficient to obtain hardness of approximation for either of these problems. Moreover, 
the approximability of many other graph layout problems is similarly unresolved, even under the 
UGC. Intuitively this is because the hard instances for these problems seem to require a certain 
global structure such as expansion. Typical reductions for these problems are gadget reductions 
which preserve global properties of the unique games instance, such as the lack of expansion. 
Therefore, barring radically new types of reductions that do not preserve global properties, proving 
hardness for c-Balanced Separator seems to require a stronger version of UGC, where the instance 
is guaranteed to have good expansion. 

In |RS10j . the Small Set Expansion (SSE) Conjecture was introduced, and it was shown that 
it implies the UGC, and that the SSE Conjecture follows if one assumes that the UGC is true for 
somewhat expanding graphs. In follow-up work by Raghavendra et al. |RST10j . it was shown that 
the SSE Conjecture is in fact equivalent to the UGC on somewhat expanding graphs, and that 
the SSE Conjecture implies hardness of approximation for c-Balanced Separator and MLA. In this 
light, the Small Set Expansion conjecture serves as a natural unified conjecture that yields all of 
the implications of UGC and also hardness for expansion-like problems that appear to be beyond 
the reach of the UGC. 

In this paper, we study the approximability of a host of such graph layout problems, including: 
treewidth and pathwidth of graphs, one-shot black and black- white pebbling. Minimum Cut Linear 
Arrangement (MCLA) and Interval Graph Completion (IGC). We prove that all of these problems 
are SSE-hard to approximate to within any constant factor. Our main contributions, giving SSE- 
hardness of approximation for all of the graph layout problems mentioned above, are described in 
the following subsections. For all of these problems, no evidence of hardness of approximation was 
known prior to our results. 

It should be noted that the status of the SSE Conjecture is very open at this point. In particular, 
by the recent result of Arora et al. [ABSIO] (see also subsequent work |BRSlll IGSllj ). it has 
algorithms running in subexponential time. Still, despite this recent progress providing negative 
evidence against the SSE Conjecture, it remains open, and we think that investigating what open 
problems in approximability we can show SSE-hardness for is a worthwhile venture. 

1.1 Width Parameters of Graphs 

The treewidth of a graph, introduced by Robertson and Seymour |RS84l IRS86j . is a fundamental 
parameter of a graph that measures how close a graph is to being a tree. The concept is very impor- 
tant since problems of small treewidth can usually be solved efficiently by dynamic programming. 
Indeed, a large body of NP-hard problems (including all problems definable in monadic second-order 
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logic |Cou90j ) are solvable in polynomial time and often even linear time on graphs of bounded 
treewidth. Examples of such optimization problems include finding the maximum independent set 
in a graph, as well as finding Hamiltonian cycles. In machine learning, tree decompositions play a 
key role in the development of efficient algorithms for fundamental problems such as probabilistic 
inference, constraint satisfaction and query optimization. (See the excellent survey |Bod05] for 
motivation, including theoretical as well as practical applications of treewidth.) 

The complexity of approximating treewidth is a longstanding open problem. Determining the 
exact treewidth of a graph and producing an associated optimal tree decomposition (see Defi- 
nition 2.4) is known to be NP-hard |ACP87j . A central open problem is to determine whether 
or not there exists a polynomial time constant factor approximation algorithm for treewidth (see 
e.g., fBGHK95'. ' FKLOSf IBodOS] ). The current best polynomial time approximation algortihm for 
treewidth [FHL05], computes the treewidth tw(G) within a factor 0(Y^logtw(G)). On the other 
hand, the only hardness result to date for treewidth shows that it is NP-hard to compute treewidth 
within an additive error of for some e > |BGHK95] . No hardness of approximation is known 
and not even the possibility of a polynomial-time approximation scheme for treewidth has been 
ruled out. In many important special classes of graphs, such as planar graphs |ST94j . asteroidal 
triple- free graphs |BT03j . and ff- minor- free graphs |FHL05| . constant factor approximations are 
known, but the general case has remained elusive. 

On the positive side, there is a large body of literature developing fixed-parameter algorithms 
for treewidth. In particular, when the runtime is allowed to be exponential in the tw(G) there 
are constant factor approximations. Furthermore, even exactly determining the treewidth is fixed- 
parameter tractable: there is a linear time algorithm for computing the (exact) treewidth for graphs 
of constant treewidth |Bod96| . 

A related graph parameter is the so-called pathwidth, which can be viewed as measuring how 
close G is to a path. The pathwidth pw(G) is always at least tw(G), but can be much larger. 
The current state of affairs here is similar as for treewidth; though the current best approximation 
algorithm only has an approximation ratio of 0( y^log pw(G) log n) |FHL05] . the best hardness 
result is NP-hardness of additive error approximation. 

Using the recently proposed Small Set Expansion (SSE) Conjecture |RS10j discussed earlier, we 
show that both tw(G) and pw(G) are hard to approximate within any constant factor. In fact, we 
show something stronger: it is hard to distinguish graphs with small pathwidth from graphs with 
large treewidth. Specifically: 

Theorem 1.1. For every a > 1 there is a c > such that given a graph G = {V, E) it is SSE-hard 
to distinguish between the case when pw(G) < c • |y| and the case when tw(G) > a ■ c - \ V\. 

In particular, both treewidth and pathwidth are SSE-hard to approximate within any constant 
factor. 

This is the first result giving hardness of (relative) approximation for these problems, and gives 
evidence that no constant factor approximation algorithm exists for either of them. 



1.2 Pebbling Problems 

Graph pebbling is a rich and relatively mature topic in theoretical computer science. Pebbling is 
a game defined on a directed acyclic graph (DAG), where the goal is to pebble the sink nodes of 
the DAG according to certain rules, using the minimum number of pebbles. The rules for pebbling 
are as follows. A black pebble can be placed on a node if all of the node's immediate predecessors 
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contain pebbles, and can always be removed. A white pebble can always be placed on a node, but 
can only be removed if all of the node's immediate predecessors contain pebbles. A pebbling strategy 
is a process of pebbling the sink nodes in a graph according to the above rules. The pebbling cost 
of a pebbling strategy is the maximum number of pebbles used in the strategy. The black-white 
pebbling cost of a DAG is the minimum pebbling cost of all possible pebbling strategies. The black 
pebbling cost is the minimum pebbling cost over all pebbling strategies that only use black pebbles. 

Pebbling games were originally devised for studying programming languages and compiler con- 
struction, but have later found a broad range of applications in computational complexity theory. 
Pebbling is a tool for studying the relationship between computation time and space by means of 
a game played on directed acyclic graphs. It was employed to model register allocation, and to 
analyze the relative power of time and space as Turing machine resources. For a comprehensive 
recent survey on graph pebbling, see |NorlOj . 

Apart from the cost of a pebbling, another important measure is the pebbling time, which is the 
number of steps (pebble placements/removals) performed. In the context of measuring memory 
used by computations, this corresponds to computation time, and hence keeping the pebbling time 
small is a natural priority. The extreme case of this is what we refer to as one-shot pebbling, also 
known as progressive pebbling, considered in e.g. |Set73[ iLenSll lKP86j . In one-shot pebbling, we 
have the restriction that each node can receive a pebble only once. Note that this restriction can 
cause a huge increase in the pebbling cost of the graph |LT82j . One-shot pebbling is also equivalent 
to a problem known as Register Sufficiency |RAK9lj . 

The one-shot pebbling problem is easier to analyze for the following reasons. In the original 
pebbling problem, in order to achieve the minimum pebbling number, the pebbling time might be 
required to be exponentially long, which becomes impractical when n is large. On the other hand, 
the one-shot pebbling problem is more amenable to complexity theoretic analysis as it minimizes 
the space used in a computation subject to the execution time being minimum. In particular, the 
decision problem for one-shot pebbling is in NP (whereas the unrestricted pebbling problems are 
PSPACE-complete). 

The one-shot black pebbling problem and one-shot black-white pebbling problems admit an 
0(-v/Iognlogn) approximation ratio. We show that they are SSE-hard to approximate to within 
any constant factor. For black pebbling we show that this holds for single sink DAGs with in- 
degree 2, which is the canonical setting for pebbling games (it seems plausible that the black- white 
hardness can be shown to hold for this case as well, though we have not attempted to prove this). 

Theorem 1.2. It is SSE-hard to approximate the one-shot black pebbling problem within any con- 
stant factor, even in DAGs with a single sink and maximum in- degree 2. 

Theorem 1.3. It is SSE-hard to approximate the one-shot black-white pebbling problem within any 
constant factor. 

No hardness of approximation result of any form was known for one-shot pebbling problems. We 
believe that these results can be extended to obtain hardness for more relaxed versions of bounded 
time pebbling costs as well. We are currently working on this, and have some preliminary results. 

1.3 The Connection: Layout Problems 

The graph width and one-shot pebbling problems discussed in the previous sections may at first 
glance appear to be unrelated. However, both sets of problems are instances of a general family 
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of problems, known as graph layout problems. In a graph layout problem (also known as an ar- 
rangement problem, or a vertex ordering problem), the goal is to find an ordering of the vertices, 
optimizing some condition on the edges, such as adjacent pairs being close. Layout problems are 
an important class of problems that have applications in many areas such as VLSI circuit design. 

A classic example is the Minimum Cut Linear Arrangement Problem (MCLA). In this problem, 
the objective is to find a permutation vr of the vertices V of an undirected graph G = (V, E), such 
that the largest number of edges crossing any point, 

max|{(n, f) G E'|7r(n) <i< ■k{v)}\, (1) 

i 

is minimized. MCLA is closely related to the Minimum Linear Arrangement Problem (MLA), in 
which the max in ([T| is replaced by a sum. 

The MCLA problem can be approximated to within a factor 0(log nyTogn). To the best of 
our knowledge, there is no hardness of approximation for MCLA in the literature. Its cousin MLA 
was recently proved SSE-hard to approximate within any constant factor |RST10| . and we observe 
that the same hardness applies to the MCLA problem. 

Theorem 1.4. Assuming the SSE Conjecture, Minimum Cut Linear Arrangement is hard to ap- 
proximate within any constant factor. 

Another example of graph layout is the Interval Craph Completion Problem (IGC). In this 
problem, the objective is to find a supergraph G' = {V,E') of G such that G' is an interval graph 
(i.e., the intersection graph of a set of intervals on the real line) and of minimum size. While not 
immediately appearing to be a layout problem, using a simple structural characterization of interval 
graphs [RR88j one can show that IGC can be reformulated as finding a permutation of the vertices 
that minimizes the sum over the longest edges going out from each vertex, i.e., minimizing 
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max max{7r(t)) — 7r(M), 0}. (2) 



See e.g., [CHKRIO] . The current best approximation algorithm for IGC achieves aratio of O ( \/log n log log n) 
ICHKRlOj . It turns out that the SSE Conjecture can be used to prove super-constant hardness for 
this problem as well. 

Theorem 1.5. Assuming the SSE Conjecture, Interval Graph Completion is hard to approximate 
within any constant factor. 

There is a distinction in IGC of whether one counts the number of edges in the final interval 
graph - this is the most common definition - or whether one only counts the number of edges added 
to make G an interval graph (which makes the problem harder from an approximability viewpoint). 
Our result holds for the common definition and therefore applies also to the harder version. 



Theorems 1.4 and 1.5 are just two examples of layout problems that we prove hardness of 
approximation for. By varying the precise objective function and also considering directed acyclic 
graphs, in which case the permutation vr must be a topological ordering of the graph, one can obtain 
a wide variety of graph layout problems. We consider a set of eight such problems, generated by 



three natural variations (see Section 2.1 for precise details), and show super-constant SSE-based 
hardness for all of them in a unified way. This set of problems includes MLA, MCLA, and IGC, but 
not problems such as Bandwidth (but on the other hand, strong NP-hardness inapproximability 
results for Bandwidth are already known [DFUllj ). See Table [T] in Section 2.1 for a complete list 
of problems covered. 
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Theorem 1.6. Assuming the SSE Conjecture, all problems listed in Table^ (see page^ are hard 
to approximate to within any constant factor. 

Let us now return to the problems discussed in the previous sections. It should not be surprising 
that the one-shot black pebbling problem is equivalent to a graph layout problem: the one-shot 
constraint reduces the problem to determining in which order to pebble the vertices; such an 
ordering induces a pebbling strategy in an obvious way. For the black-white case, it is known 
that the one-shot black-white pebbling cost of D is interreducible with a layout problem on an 
undirected graph G. Both of these layout problems are included in the set of problems we show 



hardness for, so Theorems 1.2 and 1.3 follow immediately from Theorem 1.6 

Turning to the width parameters, treewidth is equivalent to a graph layout problem called 
elimination width. Here the objective function is somewhat more intricate than in the set of basic 



layout problems we consider in Theorem 1.6, but we are able to extend those results to hold also 
for elimination width. Pathwidth is also known to be equivalent to a certain graph layout problem, 
and in fact is equivalent to the layout problem which one-shot black-white pebbling reduces to. We 
use these connections to prove the hardness of approximation for both treewidth and pathwidth, 
thereby obtaining Theorem |1.1[ 



1.4 Previous Work 

As the reader may have noticed, for all the problems mentioned, the best current algorithms 
achieve similar poly-logarithmic approximation ratios. Given their close relation, this is of course 
not surprising. Most of the algorithms are obtained by recursively applying some algorithm for the 
c-balanced separator problem, in which the objective is to find a bipartition of the vertices of a 
graph such that both sides contain a c fraction of vertices, and the number of edges crossing the 
partition is minimized. 

In the pioneering work on separators by Leighton and Rao |LR99j . an O(logn) approximation 
algorithm for c-balanced separator was given, which was used to design 0(log^ n) approximation 
algorithm for a number of graph layout problems such as MLA, MCLA, and Register Sufficiency. 
Later, |RR98j improved the approximation algorithm for MLA to a ratio O(lognloglogn), using 
a spreading metric method. In the groundbreaking work of Arora et al. |ARV09j . semidefinite 
programming was used to give an improved approximation ratio of 0{y/log n) for c-balanced sep- 
arator. Using their ideas, improved algorithms for ordering problems have been found, such as 
the O ( \/log n log log n) approximation algorithm for IGC and MLA [ CHKR10| . the 0{\/log n) ap- 
proximation algorithm for treewidth |FHL05j and the 0(\/lognlogn) approximation algorithm for 
pathwidth |FHLn5j . 

It is known that the register sufficiency problem (also known as one-shot black pebbling) admits 
a O(log^n) approximation algorithm |RAK9lj . We observe that by plugging in the improved 
approximation algorithm for direct vertex separator [ACMMOS] into the algorithm in |RAK91j . 
one can improve this to an 0(\/lognlogn) approximation algorithm. 

Again, in these algorithms, the approximation algorithm for c-balanced separator plays a key 
role. An improved algorithm for c-balanced separator will also improve the approximation algo- 
rithms for the other problems. On the other hand, hardness of approximating c-balanced separator 
|RST10j does not necessarily imply hardness of approximating layout problems. 

On the hardness side, our work builds upon the work of ( RSTlOj . which showed that the 
SSE Conjecture implies superconstant hardness of approximation for MLA (and for c-balanced 
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separator). The only other hardness of relative approximation that we are aware of for these 
problems is a result of Ambiihl et al. [AMS07], showing that MLA does not have a PTAS unless 
NP has randomized subexponential time algorithms. 



1.5 Organization 

The outline for the rest of the paper is as follows. In Section [2| we formally define the layout 
problems studied as well as treewidth and pathwidth. Section [3] gives an overview of the reductions 
used. Then Section |4] gives the reductions proving Theorem |1.6[ Section [5] we use that to prove 
Theorem and in Section |6] we give some additional reductions for our pebbling instances in 



order to achieve indegree 2 and single sinks, as promised in Theorem 1.2, Finally we end with some 
concluding remarks and open problems in Section [7j 

2 Definitions and Preliminaries 
2.1 Graph Layout Problems 

In this section, we describe the set of graph layout problems that we consider. A problem from 
the set is described by three parameters, giving rise to several different problems. These three 
parameters are by no means the only interesting parameters to consider (and some of the settings 
give rise to more or less uninteresting layout problems). However, they are sufficient to capture the 
problems we are interested in except treewidth, which in principle could be incorporated as well 



though we refrain from doing so in order to keep the definitions simple (see Section 2.2 for more 
details) . 

First a word on notation. Throughout the paper, G = (F, E) denotes an undirected graph, 
and D = {V, E) denotes a directed (acyclic) graph. Letting n denote the number of vertices of 
the graph, we are interested in bijective mappings -k : V ^ [n\. We say that an edge {u,v) G E 
crosses point i G [n] (with respect to the permutation vr, which will always be clear from context), 
if 7r(u) < i < 7r(f). 

We consider the following variations: 

1. Undirected or directed acyclic: In the case of an undirected graph G, any ordering vr of 
the vertices is a feasible solution. In the case of a DAG only the topological orderings of 
D are feasible solutions. 

2. Counting edges or vertices: for a point i G [n] of the ordering, we are interested in the 
set Ei{T[) of edges crossing this point. When counting edges, we use the cardinality of Ei as 
our basic measure. When counting vertices, we only count the set of vertices Vi to the left of 
i that are incident upon some edge crossing i. In other words, Vi is the projection of Ei^n) 
to the left-hand side vertices. Formally: 

Ei{n) = {e £ E \ -ir{u) < i < Tr{v) where e = (n, v)} 
Vi{TT) = {u £ V \ 7r(u) < i < 7r{v) for some {u, v) £ E} 

We refer to |-E'i(vr)| or |Vi(7r)| (depending on whether we are counting edges or vertices) as 
the cost of vr at i. 
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3. Aggregation by sum or max: given an ordering tt, we aggregate the costs of each point 
i £ [n], by either summation or by taking the maximum cost. 



Given these choices, the objective is to find a feasible ordering tt that minimizes the aggregated 
cost. 

Definition 2.1. (Layout value) For a graph H (either an undirected graph G or a DAG D), a cost 
function C (either E or V), and an aggregation function agg : M* — >■ M (either S or max) , we define 
Layout(i7; C, agg) as the minimum aggregated cost over all feasible orderings of H. Formally: 



Layout(i/; C, agg) = min agg |Ci(7r)|. 

feasible n jg[„] 

Example 2.2. 

Layout(G; ii^, max) = minmax |£'j(7r)|, 

71" ie[ra] 



where vr ranges over all orderings ofV{G). This we recognize from Section 1.3 as the Minimum 
Cut Linear Arrangement value of G. 

Example 2.3. 

Layout(-D; max) = min max | V^, (vr) | , 



where vr ranges over all topological orderings of the DAG D. As we shall see in Section 2.3, this is 
precisely the One-Shot Black Pebbling cost of D. 

Combining the different choices gives rise to a total of eight layout problems (some more natural 
than others). Several of these appear in the literature under one or more names, and some turn 
out to be equivalent]^ to problems that at first sight appear to be different. We summarize some of 
these names in Table [l} In some cases the standard definitions of these problems look somewhat 
different than the definition given here (e.g., for pathwidth, one-shot pebblings, and interval graph 
completion). For the pebbling and pathwidth problems, we discuss these equivalences of definitions 
in the following two sections. 

For interval graph completion, recall from Section 1.3 that the objective is to minimize 

> max max{7r(f ) — 7r(n), 0}. 

In other words, we are counting the longest edge going to the right from each point i. If the length 
of this edge is / then the edge contributes 1 to Vi{7r), . . . , Vi+i-i(vr) and hence the objective can be 
rewritten as 

SO that Interval Graph Completion is precisely Layout(G; V, S). 

^Here, we consider two optimization problems equivalent if there are reductions between them that change the 
objective values by at most an additive constant. 
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Problem 


Also known as / Equivalent vi^ith 


undir. 


edge 


sum 


Minimum/ Optimal Linear Arrangement 


undir. 


edffe 


max 


Minimum Cut Linear Arrangement 
CutWidth 


undir. 


vertex 


sum 


Interval Graph Completion 
SumCut 


undir. 


vertex 


max 


Pathwidth 

One-shot Black- White Pebbling 
Vertex Separation 


DAG 


edge 


sum 


Minimum Storage-Time Sequencing 
Directed MLA/OLA 


DAG 


edge 


max 




DAG 


vertex 


sum 




DAG 


vertex 


max 


One-shot Black Pebbling 
Register Sufficiency 



Table 1 : Taxonomy of Layout Problems 



2.2 Treewidth, Elimination Width, and Pathwidth 

Definition 2.4 (Tree decomposition, Treewidth). Let G = {V,E) be a graph, T a tree, and let 
V = {Vt)t£T be a family of vertex sets Vj C 1/ indexed by the vertices t of T. The pair (T, V) is 
called a tree decomposition of G if it satisfies the following three conditions: 

(Tl) V = UteTVt; 

(T2) for every edge e G E, there exists a t G T such that both endpoints of e lie in Vt; 

(T3) for every vertex v£V,{t£T\v£ Vt} is a subtree of T\ 

The width of (T, V) is the number max{| Vt| — l\ t £ T}, and the treewidth of G, denoted tw(G), 
is the minimum width of any tree decomposition of G. 

Definition 2.5. Let G = {V,E) be a graph, and let vi,. . . ,Vn be some ordering of its vertices. 
Consider the following process: for each vertex Vi in order, add edges to turn the neighborhood of 
Vi into a clique, and then remove Vi from G. This is an elimination ordering of G. The width of 
an elimination ordering is the maximum over all Vi of the degree of Vi when Vi is eliminated. The 
elimination width of G is the minimum width of any elimination order. 

Theorem 2.6 (See e.g., |Bod07p . For every graph G, the elimination width of G equals tvj{G). 
Thus treewidth is another example of a layout problem. In principle this layout problem can 



be formulated in the framework of Section 2.1 , but the choice of cost function is now more involved 



than the vertex- and edge-counting considered there. 

Definition 2.7 (Path decomposition, Pathwidth). Given a graph G, we say that (T, V) is a path 
decomposition of G if it is a tree decomposition of G and T is a path. The pathwidth of G, denoted 
pw(G), is the minimum width of any path decomposition of G. 
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As claimed earlier, pathwidth is in fact equivalent with a graph layout problem: 

Theorem 2.8 ( jKin92j ). For every graph G, we have p\N{G) = Layout(G; max), also known 
(among many other names) as the "vertex separation" number of G. 

2.3 Pebbling Problems 

In this section we define pebbling problems and their one-shot versions. 

Definition 2.9. (Pebbling Configurations) Let D = {V,E) be a directed acyclic graph (DAG). A 
pebbling configuration of D is a pair {B, W) of (disjoint) subsets vertices (representing the set B of 
vertices that have black pebbles, and the set W of vertices that have white pebbles on them). 

Definition 2.10. (Black and Black- White Pebbling Strategies) Let D = {V, E) be a directed acyclic 
graph. A black-white pebbling strategy for D is a sequence of pebble configurations V = {Pq, . • • , -Pt} 
such that: 

(i) the first and last configurations contain no pebbles; that is Pq = Pr = (0, 0). 

(ii) each sink vertex u of D is pebbled at least once, i.e., there is some Pt = {Bt,Wt) such that 
u€ BtU Wt- 

(iii) each configuration follows from the previous configuration by one of the following rules: 

(a) A black pebble can be removed from a vertex. 

(b) A black pebble can be placed on a pebble- free vertex v if all of the immediate predecessors 
of V are pebbled. 

(c) A white pebble can be placed on a pebble-free vertex. 

(d) A white pebble can be removed from a vertex v if all of the immediate predecessors of 
V are pebbled. 

A black pebbling strategy for G is a black-white pebbling strategy in which no white pebbles are 
used. 

The cost of a pebbling strategy is cost{V) = maxQ<t<r{\Bt U Wt\}. The black- white pebbling 
cost of D is the minimum cost of any black- white pebbling strategy of D, and similarly the black 
pebbling cost of D is the minimum cost of any black pebbling strategy of D. 

Definition 2.11. (One-Shot Black and One-Shot Black-White Pebbling) A one-shot black (resp. 
black-white) pebbling strategy is a black (resp. black- white) pebbling strategy in which each node 
is only pebbled once. The one-shot black (resp. black-white) pebbling cost of D, denoted BP^^(D) 
(resp. BWP^^(L')) is the minimum cost of any one-shot black (resp. black-white) pebbling strategy 
oiD. 

As mentioned in Table [T| the one-shot pebbling problems can be formulated as Layout problems. 
Lemma 2.12. For every DAG D = {V,E), we have EP^\D) = Layout(L', max). 
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Proof. Suppose vr is the optimal ordering of Layout(D, y, max), we pebble the vertices according 
to TT. We remove a pebble from vertex u if and only if all of the successors of u are pebbled. 
Since tt is a topological order of D, this is a valid pebbling strategy. It is easy to verify that after 
pebbling 7r(z), the number of pebbles on the graph is |Vi(7r)|. Therefore the number of pebbles used 
in the above strategy is Layout(D, max). On the other hand, suppose T is the optimal pebbling 
strategy, let a be the ordering of vertices to receive a pebble in T. We consider the number of 
pebbles on the graph after pebbling the i-th vertex in a. For any vertex u that has a pebble, if the 
vertex has a successor that has not yet be pebbled, then the pebble on u cannot be removed, since 
u cannot be pebbled again. Therefore the number of pebbles on the graph is at least Vi{a). Thus 
BP^'(-D) > maxig[„] \ Vi{a)\ > Layout(L», max). □ 

For one-shot black- white pebbling, we have the following reductions by Lengauer |Len81| , show- 
ing that one-shot black-white pebbling is equivalent to the undirected max-vertex layout problem. 

Lemma 2.13 ( |Len81| ). For a given DAG D = [V^E), let Go = (VjEd) be an undirected graph 
with Ed = {{v, w) \ (v, w) G E} U {{v, w) \ 3u, {v, u), {w, u) G E}. Then 

BWP^=(D) = Layout(GD,y,max) - 1. 

Lemma 2.14 ( ^Len81| ). For an undirected graph G = iV,E), let Dq = (V U E,Eg) be a DAG 
with Eq = {{v, e)\e & E,v G V,v £ e}. Then 

Layout(G, V, max) = B\NP^%Dg) + 2. 



2.4 Small Set Expansion Conjecture 

In this section we define the SSE Conjecture. Let G = {V, E) be an undirected d-regular graph. 
For a set S C y of vertices, we write ^g{S) for the (normalized) edge expansion of 5, 

\E{S,V\S)\ 
= 

The Small Set Expansion Problem with parameters r/ and 5, denoted SSE(?7,5), asks if G has a 
small set S which does not expand or whether all small sets are highly expanding. 

Definition 2.15 (SSE(77, 5)). Given a regular graph G = iV^E), SSE(?7, (5) is the problem of 
distinguishing between the following two cases: 

Yes There is an S" C y with |5| = 5\V\ and ^g[S) < r]. 

No For every S with \S\ = 6\V\ it holds that $g('S') > 1 - ry. 

This problem was introduced by Raghavendra and Steurer |RS10| . who conjectured that the 
problem is hard. 

Conjecture 2.16 (Small Set Expansion Conjecture). For every rj > 0, there is a 5 > such that 
SSE(7?,(5) is NP-hard. 
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As has become common for a conjecture like this (such as the Unique Games Conjecture), we 
say that a problem is SSE-hard if it is as hard to solve as the SSE problem. Formally, a decision 
problem V (e.g., a gap version of some optimization problem) is SSE-hard if there is some > 
such that for every (5 > 0, SSE(r/, 5) polynomially reduces to V. 

Subsequently, Raghavendra et al. [RSTIO] showed that the SSE Problem can in turn be reduced 
to a quantitatively stronger form of itself. To state this stronger version, we need to first define 
Gaussian noise stability. 

Definition 2.17. Let p G [-1, 1]. We define Tp : [0, 1] [0, 1] by 

Tpi^i) =Pr[X< $-i(/x) AY< $-i(^)] 



where X and Y are jointly normal random variables with mean and covariance matrix 



1 P 
P 1 



The only fact we shall need about Tp is the asymptotic behaviour for p close to 1 and /i bounded 
away from 0. 

Fact 2.18. There is a constant c > such that for all sufficiently small e and all fi G [1/10; 1/2], 

ri_,(/x) < fi{l - c^/e). 
We can now state the strong form of the SSE conjecture. 

Conjecture 2.19 (SSE Conjecture, Equivalent Formulation). For every integer q > and€,j > 0, 
it is NP-hard to distinguish between the following two cases for a given regular graph G = (V, E) 

Yes There is a partition of V into q equi-sized sets Si, . . . ,Sq such that ^ciSi) < 2e for every 
l<i<q. 

No For every S OV, letting = \S\/\V\, it holds that <^g{S) > 1 - {Ti-e/2ip) + 1)/ P- 

For future reference, let us make two remarks about the strong form of the conjecture. 
Remark 2.20. In the Yes case of Conjecture \2.19[ the number of edges leaving Si is at most 

\E{Si, V\Si)\= ^G{S^)d\S\ < Ae\E\/q. 
In particular, the total number of edges that are not contained in one of the Si 's is at most 

lj2\E{S,,V\S,)\<2e\E\. 

i 



Remark 2.21. Using Fact 2.18 we see that, in the No case of Conjecture 2.19, we have 

provided fj, € [1/10, 1/2] and setting 7 < ^/e. In particular, for every |T^|/10 < |5| < 9|y|/10, we 
have \E{S, V \ S)\ > Cy/e\E\ (switching roles of S and V \ S for \S\ > \V\/2), for some universal 
constant c (not the same constant as in Fact 2.18). 
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Figure 1: Overview of Reductions. Dashed arrows indicate that the reduction is obtained by the 
identity mapping, whereas soUd arrows indicate a nontrivial transformation from one problem to 
the other. 

3 Overview of Reductions 

We shall proceed as follows: for the two undirected edge problems (i.e., MLA and MCLA), the 
hardness follows immediately from the strong form of the SSE Conjecture (Conjecture 2.19[ ) 



for the case of MLA this was proved in [RSTlOj and the proof for MCLA is similar. We then 



give in Section 4.2 a simple reduction from MLA/MCLA to the four directed problems, and in 



Section 4.3 a similar reduction from MLA/MCLA to the two undirected vertex problem. The 



results for treewidth, which are presented in the next section, follows from an additional analysis 



of the instances produced by the reduction of Section 4.3 Unfortunately, the results do not follow 
from hardness for MLA/MCLA in a black-box way; for the soundness analyses we need to use the 
expansion properties of the SSE instance. 

We then give a reduction from MLA/MCLA with expansion, to the four directed problems. 
This reduction simply creates the bipartite graph where the vertex set is the union of the edges and 
vertices of the original graph G, with directed arcs from an edge e to the vertices incident upon 
e in G. The use of direction here is crucial: it essentially ensures that both the vertex and edge 
counts of any feasible ordering corresponds very closely to the number of edges crossing the point 
in the induced ordering of G. 

To obtain hardness for the remaining two undirected problems, we perform a similar reduction 
as for the directed case, creating the bipartite graph of edge-vertex incidences. However, since we 
are now creating an undirected graph, we can no longer force the edges to be chosen before the 
vertices upon which they are incident, which was a key property in the reduction for the directed 
case. In order to overcome this, we duplicate each original vertex a large number of times. This 
gives huge penalties to orderings which do not "essentially" obey the desired direction of the edges, 
and makes the reduction work out. 

The results for treewidth, which are presented in Section [5} follows from an additional analysis 
of the instances produced by the reduction for undirected vertex problems. Finally, the reduction 
for directed problems, implying hardness for one-shot black pebbling, does not produce the kind of 



"nice" instances promised by Theorem 1.2 In Section [6j we give some additional transformation 
to achieve these properties. 

Figure [T] gives a high-level overview of these reductions. 
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4 Hardness For Layout Problems 



In this section, we show that ah of the layout problems defined in Section 2.1 are SSE-hard to 
approximate within any constant factor. This also shows that pathwidth and the one-shot pebbling 
problems are hard to approximate within any constant. 



4.1 Hardness for MCLA and ML A 

In this section, we recall the proof of f RSTlOj for MLA, and observe that it applies for MCLA 
as weh. For an undirected graph G, let us write MCLA(G) (resp., MLA(G)) for the MCLA value 
(resp., MLA value) of G, i.e., 

MLA(G) = Layout(G; E, S) = min V \E,{'k)\ 

TV ^—^ 

ie[n] 

MCLA(G) = Layout(G; max) = minmax |£;j(7r)|. 

1" [n] 

Theorem 4.1. For every e > 0, given a graph G = iy,E), it is SSE-hard to distinguish between: 
Yes MLA(G) < 0(e • \V\ ■ \E\) and MCLA(G) < 0{e\E\) 

No For every S with \V\/W < \S\ < 9|y|/10, it holds that \E{S,V \ S)\ > n{^/e\E\). In 

particular, MLA(G) > il{^fe ■ \V\ ■ \E\) and MCLA(G) > n{'^e\E\). 



Proof. We use the instances for Conjecture 2.19| with q = 1/e. Let G = {V,E) be an instance for 



Conjecture 2.19 



In the Yes case, we have disjoint sets Si, ■ ■ ■ , Sq and for each set Sj, \Sj\ = n/q = en, ^ciSj) < 
2e. We give an ordering vr of the vertices such that maxjg[„] |£'j(7r)| < 3e|S| as follows. Order the 
vertices as Si, Sq (with the order within each Sj chosen arbitrary) and let this order be vr. For 
any i S [n], we show that |-E'j(7r)| < 3e|ii^|. Suppose ir^^ii) is a vertex in Sj. Each edge in Ei{7r) 
either has both end-points inside Sj, or its end-points in two different S^s. The total number of 



edges inside Sj is at most edn/2 = €\E\. Moreover, by Remark 2.20, the total number of edges with 
end-points in two different S^s is at most 2e|i?|. Therefore, l\/ICLA(G) < maxj |£'j(7r)| < 3e|£'j(7r)|. 
The MLA value can be bounded similarly. 



The property of the No instance is the same as in Conjecture 2.19 (via Remark 2.21), and the 



implications for the MLA and MCLA values are immediate. □ 



4.2 Reduction To Directed Graphs 

Given an undirected graph G = {V,E), we construct a directed graph D = {V',E') as follows. In 
order to distinguish the elements of V and E from the elements of V and E' , we refer to elements 
of V as vertices, elements of E as edges, elements of V' as nodes, and elements of E' as arcs. 

There is a node in D for each vertex and for each edge of G, i.e., V = V L) E. The graph D is 
bipartite with bipartition V, E, and there is an arc in D from e £ E to v €z V if e is incident upon 
V. Formally, 

V' = VUE 

E' = {{e,v)\e G E,v gV,v G e}. 
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See also Figure [2| 

The remainder of this section is devoted to analyzing the reduction. First, it is easy to give an 
upper bound on the four Layout values of D in terms of the MLA and MCLA values of G. 

Lemma 4.2. The DAG D constructed from G as above satisfies the following: 

Layout(D; E, S) < (MLA(G) + 0{\E\)) • (d + 1) 
Layout(L>; E, max) < MCLA(G) + d. 



Note that, for the purposes of applying this to the graphs of Theorem 4.1 the error term of 
(resp. d) is insignifcant compared to the MLA (resp. MCLA) value of G. 

Proof. Consider an ordering vr of V . For a set of vertices S of V , let Ut^{S) € S denote the vertex 
of S that comes first in the ordering vr. 

We extend vr to an ordering vr' of V by inserting each edge e = {u,v) immediately before the 
vertex ^^(e). It is easy to see that for each node z G V L) E' , 

\E,{7^')\ < \E^^(^,){7r)\ + d 

This immediately implies 

LayoutfD; i?, max) < max lE'xfvr')! < max lE'^jfvr)! + d, 

Setting vr to be an optimal MCLA ordering of G, we obtain the second claim of the Lemma. 
Similarly, using that |u~^(w)| < d + 1 for every v & V, we get 

Layout(D;^,S) < ^ \E,{7,')\ < (d + 1) ^ |^4vr)| + 
zev u&v 

Setting vr to be an optimal MLA ordering of G and using \V'\ = 0{\E\), we obtain the first claim 
of the Lemma. □ 



Next we use the strong soundness property of Theorem 4.1 to argue about the soundness of D 
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Lemma 4.3. Suppose G has the property that for every |1^|/10 < |5| < 9|y|/10 we have \E{S,V\ 
5)1 > n{^e\E\). Then, 

Layout(L';y,S) > n{^€\E\^) 
Layout(L>; y, max) > VL[^ft\E\) 



Proof. Let vr' be any ordering of V . Using the expansion property of Theorem 4.1 , we'h show that 
this ordering must have high cost. For a point i S [A^], let Si be the set of vertices of V that appear 
after i in tt'. 

The bound on Layout(D; max) is immediate: consider a point i G [N] such that \Si\ = \V\/2. 
By the expansion property \E{Si,V \ Si)\ > ^l{^/€\E\), and since each such edge e has one of 
its endpoints before point i, the node e itself must appear before point i and thus |Fi(7r')| > 
\E{Si,V\Si)\>n{^e\E\). 

Let us then turn to Layout(Z); V, S). Write Cj for the fraction of edges e that appear before (or 
at) point i in vr'. We shall show that whenever 1/5 < Cj < 4/5, we have |Vi(7r')| > ^l{^/€\E\), giving 
a total of Layoutp; V, S) > n{./^\E\'^). 

By a simple counting argument, we have 

d\Si\ > 2{l -Ci)\E\, 

implying \Si\ > (1 — Cj)|y| which for a < 4/5 is at least a 1/5 fraction of vertices. If in addition 
I'S'il < 9|F|/10, the argument above gives |Vi(7r')| > Q{^/e\E\). The remaining case is that \Si\ > 
9|y|/10. But then Si is incident upon at least a 9/10 fraction of edges. This implies that the 
number of edges incident upon Si, appearing before i in vr', are at least I-EKq — 1/10) which for 
Ci>l/5isn{^\E\). □ 



Combining Lemma[4.2|and Lemma 4.3 with Theorem |4.1[ and using the fact that edge costs are 



always larger than the corresponding vertex costs, we immediately obtain the following theorem. 

Theorem 4.4. Given a DAG D, Layout(D; i?, max), Layout(L'; i?, S), Layout(D; y, max), and 
Layout(L>; y, S) are all SSE-hard to approximate within any constant factor, even in DAG's with 
maximum path length 1 (i.e., every vertex is a source or a sink). 



Remark 4.5. In fact we see that, as in Theorem ^.1, the four hardness results applies to the same 



instance, so that it is SSE-hard to distinguish all of the four Layout values being high from all of 
them being low. 

As the one-shot black pebbling problem is precisely Layout(-D; y, max), we obtain hardness for 
one-shot black pebbling as an immediate corollary. However, the instances are not single-sink DAGs 
with maximum indegree 2, as promised in Theorem 1.2 In Section [6] we show how to transform 
the instances further to obtain such DAGs. 



4.3 Undirected Vertex Problems 

The reduction for undirected vertex problems is very similar to the reduction for directed problems 
given in the previous section. As before, we introduce nodes for every edge of G. As in the directed 
case, we are interested in orderings where an edge appears before its two endpoints, but we cannot 
use direction to force this anymore. Instead, we ensure that orderings that are not like this incur 
a high cost by replicating each node corresponding to a vertex of G many times. 
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Figure 3: The reduction from G to G' , illustrated for r = 3. 



Given an undirected graph G = {V, E), we construct a new graph G' = {V , E') as follows. 

There are r nodes in G' for each vertex and one node for each edge of G, i.e., = ^ x [r] U E. 
For a vertex u ^ V we write u^, . . . to denote the r copies of u and refer to each such set of r 
nodes as a vertex group. The graph G' is bipartite with bipartition V x [r], E, and there is an edge 
in G' between e £ E and f * G y x [r] if e is incident upon v. Formally, 

V' = {v'\v eV,ie[r]} 

E' = {{e,v')\e e E,v e V,v e e,i £ [r]}. 

See also Figure [3} 

Lemma 4.6. The graph G' constructed from G as above satisfies the following: 

Layout(G'; V, S) < (d + r) MLA(G) 
Layout(G';y,max) < MCLA(G'). 



Proof. We proceed as in the proof of Lemma 4.2 An ordering noiV naturally induces an ordering 



it' of V: put all r copies of u £ V consecutively, with vertices of V appearing in the same order as 
in vr, and insert each edge e G E immediately before its first vertex. Again, for an edge e G E, let 
u-,riz) denote the endpoint of e that appears first in vr. Similarly, for a copy v"^ £ V of v £ V, let 
u-niv^) = V. It is easy to see that the constructed ordering vr' satisfies 

|T4(vr')| < |i^n.{.)(vr)| 

for every z £V' . This immediately implies 

Layout (G' ; y, max) < max iT^fTi"')! ^ max |£'u(7r)|. 

Similarly, we get 

Layout(G'; F, S) < ^ |y,(7r')| < (d + r) | (vr) | . 

zev u&v 

□ 
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Lemma 4.7. Suppose G has the property that for every |1^|/10 < |5| < 9|y|/10 we have \E{S,V\ 
5)1 > VL{^e\E\). Then, if r > \V\ ■ \E\, we have 

Layout(G'; V, S) > n{-^e ■r-\V\- \E\) 
Layout(G';y,max) > n{^e\E\) 

Proof. Let tt' be an ordering of V . First we have the following simple claim, establishing that for 
good orderings, most vertices appear after their edges. 

Claim 4.8. Suppose that for some vertex u £V, at least r/2 of the copies of u in G' appear before 
some edge e = {u,v) €z E adjacent upon u. Then 

max |V-(7r')| > r/4 > Q{,/^ • \E\) 
ie[N] 

|V-(7r')| > (r/4)2 » • r • |y| • 

ielN] 

Proof. Let Ii be the first half of the positions where copies of u appear before e, and I2 the second 
half. Thus, |/i|,|/2| > r/4. Then each element of Ii contributes to l^(vr') for each i S I2, giving 
the claimed bounds. □ 

Thus we may without loss of generality assume that for each vertex u of V, at least r/2 of its 
r copies in G' appear after all edges adjacent upon u. Prom now on, let us discard all the < r/2 
"bad" copies of each vertex of V that appear before some of its edges. This only decreases the cost 
of tt' , and there are still > r|V|/2 vertex nodes left. 

Let ii be the (first) point of vr' such that r|V|/10 vertex nodes are to the left of ii, and Z2 the 
(last) point of tt' such that r|y|/10 vertex nodes are to the right of 12- 

Claim 4.9. For any point i between ii and i2, we have |T^(7r')| > i}{^/€\E\). 

Proof. Let S C V (resp. T C y) be the set of vertices u such that some copy of u appears before 
i (resp. after i). We then have |r| > ^^^^j^'*''^ > |T/|/5, and S U T = V. Thus we can partition 
V into S" C 5, r' C T such that |5'|,|T'| < 4|y|/5. By the expansion property of G we have 
\E(S',T')\ > n{^/e\E\). Further, we also have \Vi{TT')\ > \E{S',T')\ as each e G E{S',T') must 
appear before i in vr' (because one of their endpoints is in S) but have an edge crossing i (because 
the other of their endpoints is in T). □ 



From Claim 4.9, the proof of the lemma follows immediately. □ 



As in the previous section, we can now combine Lemma 4.6 and Lemma 4.7, with Theorem 4.1 
to obtain: 

Theorem 4.10. Given a graph G, Layout(G; y, max), Layout(G; y, S) are both SSE-hard to ap- 
proximate within any constant factor, even in bipartite graphs. 

As the pathwidth problem is precisely Layout(G; F, max), we obtain hardness for pathwidth 
as an immediate corollary. In the next section, we'll show the stronger soundness required for 
Theorem II. 1[ 
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5 Hardness For Treewidth 



In this section we shall complete our proof of Theorem |1.1| by showing that the hard instances for 



pathwidth from Theorem 4.10 also have large treewidth. 



Lemma 5.1. Let G = {V^E) he an undirected graph with the property that for every |y|/10 < 
\S\ < 9|y|/10 we have \E{S, V \ S)\ > ^}{^/e\E\), and let G' he the graph obtained hy applying the 
reduction of Section \4-^ to G. Then, if r > \V\ ■ \E\, we have 

tw(G') > n{^/e\E\) 



To prove Lemma 5.1 , we shall use the fact that the treewidth of a graph is closely related to an 



expansion-like property called the 1/2-separator number, defined in [B GHK95j . 

Definition 5.2 (1/2-vertex separator, 1/2-separator number). Let G = {V,E) be an undirected 
graph. For C y, a 1/2-vertex separator of in G is a set C y of vertices such that every 
connected component of the graph G[V — S] contains at most \W\/2 vertices of TV. Let iPg{1/2,W) 
denote the minimum size of a 1/2-vertex separator of W in G. We define the 1/2-separator numher 
Ki/2{G) to be 

Ky,{G) = m^x^PG{l/2,W). 
wcv 

Lemma 5.3 ((BGHK95]). For every graph G = {V,E), it holds that tw(G) > Ki/2{G) - 1. 
Using this, it is now straightforward to prove the lower bound on the treewidth. 



Proof of Lemma 5J_, We'll show that ipG'i^f^, ^ ^(v^l-^l) we choose W = V). Suppose 
G is an optimal 1/2-vertex separator of V and it separates V \G into / sets F/, . . . , V/, each of 
size at most |y|/2. By merging different V- we may assume that we only have two sets V( and V^', 
both of size at least \V'\/5. 



Now, similarly to the proof of Claim 4.9, let 5 C y (resp. T Q V) he the set of vertices v such 
that some copy of V appears in V( (resp. V2). As \V{\, \V2\ > r|y|/5, this implies that both |r| 
are at least 1^1/5, and furthermore SUT = V (since otherwise all r copies of some vertex are in C, 
implying |C| > r 3> 0,{^/e\E\)). We can thus choose a balanced partition S',T' such that S' C S, 
T' C T, and we have \e\s' , T')\ > n{y/^\E\). But every edge e = (u, v) such that u e S' and v € T' 
must belong to C, since it is connected (in G') to every copy of u and v. □ 



6 Nicer Pebbling Instances 

In this section we show how to transform our hard instances for one-shot black pebbling so as to 
have in-degree bounded by 2 and single sinks. 
We begin with the in-degree. 

Lemma 6.1. Given a DAG D = {V,E) we can in polynomial time construct a DAG D' = (V',E') 
such that every node of D' has in-degree at most 2 and 

BP^^{D) < BP^'{D') < BP^'{D) + d, 

where d is the maximum in-degree of D. 
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Figure 4: Pyramid of size 4 



As the proof of this lemma is somewhat lengthy, we defer it until later in this section and first 
describe how to obtain a DAG with a single sink. 

Lemma 6.2. Given a DAG D = {V,E) we can in polynomial time construct a DAG D' = {V',E') 
such that D' has a single sink and 

BP^^D) < BP^'{D') < BP^'{D) + s + 1, 

where s is the number of sinks in D. Furthermore, if D has maximum in-degree 2 then so does D' . 

Proof. Construct D' by adding a binary tree with s leaves to D, and identifying the leaves of the 
tree with the sinks of D. The properties of D' are easily verified. Since D' is a super-DAG of D, 
its pebbling cost must be at least BP^^(L'). Conversely, a valid pebbling of D' can be obtained by 
using a one-shot pebbling of D but without removing pebbles from the sinks, and then pebbling 
the tree. □ 



Given Lemmas 6.1 and 6.2 we can derive Theorem 1.2 The point is that for the graphs produced 
in Theorem 4.4, the maximum indegree d equals the degree of the original SSE instance, and the 
number of sinks s equals the number of vertices of the original pebbling instance. On the other 
hand, the one-shot black pebbling cost (i.e., the Layout(-D; y^,max) value) is of order e\E\ which is 
much larger than d and s, so the additive losses of Lemmas |6.1| and |6.2| are insignificant. We omit 
the details. 

6.1 Lemma 16.11 



In this section we prove Lemma 6.1 First, recall the definition of a pyramid graph. 



Definition 6.3. A pyramid graph of size d is a layered graph of indegree two, with d layers, labelled 
0, 1, . . . , d — 1. Layer zero (the input layer) consists of d vertices, and layer i contains d — i vertices. 
See Figure |4) 



The reduction of Lemma 6.1 to produce DAGs of indegree 2 is as follows. Construct D' by 
replacing each vertex n by a pyramid Pu of size d{u) (here, d{u) denotes the indegree of n), where 
the d{u) vertices at layer of Pu are identified with the predecessors of n, and u is identified with 
the vertex at layer d{u) — 1 of Pu- See Figure [sj 

To prove the lemma we need to show that D' constructed this way satisfies 

BP^\D) < BP^^D') < BP^^D') + d. 
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Figure 5: Reduction to DAGs of indegree 2 



where d is the maximum indegree of any vertex u of D. 

In what follows whenever we say "pebbling strategy" of D or D' we always refer to a one-shot 
black pebbling strategy of D oy D' . 

The upper bound on BP^^(Z)') is trivial: if 5 is a valid pebbling strategy for D, then clearly we 
can create a corresponding pebbling strategy for D' by pebbling through the pyramid whenever D 
pebbles the sink of the pyramid. This takes at most d additional pebbles. 

In the other direction, we want to show that a pebbling strategy for D' can be converted into 
a pebbling strategy for D. We will first show that D' can be assumed to be in a particular normal 
form, and then using this normal form, we will show how to simulate the pebbling. 

Definition 6.4. Let S' be a pebbling strategy of D' . That is, S' is a sequence of configurations, 
where each configuration is a set of black pebble placements, and such that the sequence of con- 
figurations follows by the black pebbling rules. We say that configuration c G 5' is saturated with 
respect to a pyramid P„ if c is the first time in S' that there is a black pebble path cutting the sink 
of Pu from all of the sources of Pu- (The cut does not include any sources or the sink of P^.) Note 
that this cut has size d—l. 

Claim 6.5. Let S' he a pebbling strategy for D' . We can assume without loss of generality that 
D' has the following normal form. For each configuration c' £ S' , if c' is saturated with respect to 
pyramid Pu, then the subsequent moves of S' pebble the sink of Pu (in the obvious way), removing 
all other black pebbles on the internal nodes of Pu ■ 

Proof Sketch. At a saturated configuration c', there must be d — 1 pebbles on internal nodes of 
Pu- If we subsequently pebble the sink of p, we will never use more than d—l pebbles on internal 
nodes of p, and all other pebbles on the graph stay as they were. Thus the normal form does not 
use more pebbles than the original strategy. Furthermore, since the internal nodes of a pyramid 
are only used to pebble the sink of this pyramid, we have not lost anything by pebbling through 
to the sink and removing the other internal black pebbles. □ 

From now on we will assume that the pebbling S' of D' has the above normal form. That is, 

if a configuration is saturated (with respect to a pyramid Pu), the next thing that happens in S' 
is to pebble the sink of Pu- (After pebbling the sink, we will have not touched whatever pebbles 
were on the source nodes of Pu, and we will have a pebble on the sink node of Pu, and no other 
internal pebbles on P„.) 

Our strategy for constructing a pebbling, S, of D, given a normal form pebbling, S' of D' is as 
follows. For each node v of D, pebble v whenever it is first pebbled in S' , and remove the pebble 
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from V as soon as all successors of v (in the original graph D) are pebbled. We want to argue that 
this pebbling strategy of D is not greater than that of D' . To see this, we will use the following 
Lemma. 

Lemma 6.6. In any frugal read-once black strategy of a size d pyramid, the number of pebbles on 
the pyramid at any point in time, up until all sources are pebbled for the first time, must be equal 
to the number of sources in that pyramid that have been pebbled so far. 

Assuming the above Lemma it is clear that if S' is a normal form pebbling of D' , then for 
any pyramid p in D', and any configuration c', if there are k pebbles on at c', then in the 
corresponding configuration c of D, there are at most k pebbles on source nodes of Pu- To see this, 
first notice that by the above Lemma, anytime a pyramid is being pebbled in D' up until the time 
when all source nodes of the pyramid are pebbled for the first time, the number of pebbles on the 
pyramid will be at least as large as the number of source nodes in D that contain pebbles. Then 
by the normal form property of D\ as soon as all source nodes of D' are pebbled for the first time, 
the strategy pebbles the sink of D' , and thus again the number of corresponding pebbles on D is 
never greater than the number of pebbles on D' . 



Proof of Lemma 6.6. Let P be a size d pyramid graph, and let S a one-shot black pebbling of P. 
Let c be a configuration occuring in S such that the set of source nodes that have been pebbled 
up to c are the source nodes of P', where P' is a size d' sub- pyramid of P. We want to argue 
that c must contain at least d' pebbles. Assume without loss of generality that P' is the leftmost 
sub-pyramid of P, of size d' < d. Label the outer rightmost vertices of P' by Vd-i, ■■■,vq, where 
Vd-i is the sink vertex of P' , and for all i < d — 1, Vi is the rightmost vertex in P' at level i. 
Corresponding to each named vertex Vi is a diagonal set of vertices, diag{vi), beginning at Vi and 
travelling southwest to a source vertex of P'. Note that the sets diag{vi) are pairwise disjoint. We 
will argue that for each i, < i < d — 1, at least one vertex from diag{vi) must appear in c. To see 
this, first notice that for each Vi, there is a vertex v'^ that is an immediate successor of Vi and that 
lies outside of P' . This vertex v'^ must be pebbled at some time after configuration c, since it has a 
predecessor that has not yet been pebbled. But in order to pebble v'^ in the future, there must be 
a black pebble on some vertex in diag{vi) in c. Thus, we have shown that if c is any configuration 
in S such that d' < d source vertices are pebbled thus far, then there must be d' vertices pebbled 
in c. □ 



7 Conclusion and Open Problems 

We proved SSE-hardness of approximation for a variety of graph problems. Most importantly we 
obtained the first inapproximability result for the treewidth problem. 

Some remarks are in order. The status of the SSE conjecture is, at this point in time, very 
uncertain, and our results should therefore not be taken as absolute evidence that there is no 
polynomial time approximation for (e.g.) treewidth. However, at the very least, our results do give 
an indication of the difficulty involved in obtaining such an algorithm for treewidth, and builds 
a connection between these two important problems. We also find it remarkable how simple our 
reductions and proofs are. We leave the choice of whether to view this as a healthy sign of strength 
of the SSE Conjecture, or whether to view it as an indication that the conjecture is too strong, to 
the reader. 

There are many important open questions and natural avenues for further work, including: 
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1. It seems plausible that these results can be extended to a wider range of graph layout prob- 
lems. For instance, our two choices of aggregators max and S can be viewed as taking 
and i\ norms, and it seems likely that the results would apply for any 1^ norm (though we 
are not aware of any previous literature studying such variants). 

2. It would be nice to obtain hardness of approximation result for our problems based on a weaker 
hardness assumption such as UGC. It is conjectured in jRSTlO] that the SSE conjecture is 
equivalent to UGC. Alternatively, it would be nice to show that hardness of some of our 
problems imply hardness for the SSE Problem. 

3. For pebbling, it would be very interesting to obtain results for the unrestricted pebbling 
problems (for which finding the exact pebbling cost is even PSPACE-hard). As far as we are 
aware, nothing is known for these problems, not even, say, whether one can obtain a non- 
trivial approximation in NP. As mentioned in the introduction, we are currently working on 
extending our one-shot pebbling results to bounded time pebblings. We have some preliminary 
progress there and are hopeful that we can relax the pebbling results to a much larger class 
of pebblings. 
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